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ABSTRACT 

In this paper, sufficient statistics are proposed to measure the amount of information in censored and 
progressively censored data. Under specified life testing experiments, the sufficient statistics are obtained and used to 
identify the relative losses of information and the efficiency due to the censoring schemes. Comparisons between the 
different censoring mechanisms, the main affecting factors in the amount of information and the validity of sufficient 
statistics as measures of information are explored through both theoretical and numerical results. 

KEYWORDS: Progressively Censored Data, Relative Efficiency, Relative loss of Information, Sufficient Statistic Type 
I and Type II Censored Data 

1. INTRODUCTION 

In reliability and survival studies. Life testing experiments are performed to provide the failure times data required 
for any statistical inference about the lifetime model. Under the normal used conditions most units need a long time for 
failure. Thus having a complete data set is costly and time consuming. To overcome this problem, the life testing 
experiments are frequently incorporate censoring schemes with a relative loss of data that affect the efficiency for any 
statistical modeling. 

The most censoring mechanisms are type I (time) censoring, where the life testing experiment will be terminated 
at a prescribed time T > 0, and type II (failure) censoring , where the life testing experiment will be terminated upon the 
r th failure, where, (r < n is a pre-fixed) number of failures. Using both types of censoring, progressively censored data 
my obtained when at various stages of the experiment, some of the surviving units are eliminated from further 
observations. 

Statisticians have long endeavored to develop a precise notation for the information in the sampling data. Fisher 
information is basically considered by many authors with several censoring schemes as in.[l],[2],[3] . It is also involved in 
.[4] for the progressively type II censoring. Fisher information in the generalized order statistics is studied by [5]. Others of 
many related references are in([6]-[9]). 

Despite the importance of fisher information measure, the differential entropy measure introduced by 
Shannon. [10] is utilized in the advanced information theory. Statistically it used by many authors as in [11], [12], [13] and 
[14]For continuous random variable x distributed with probability distribution function f(x), it is defined by 

/(*) = j\og(f(x))f(x)dx (1) 

For the negativity of Shannon information in some cases, Awad(1987) suggested a modification for Shannon 
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entropy by 

A(x) = -E{log f -^f-) , where A = sup x f(x) (2) 

This measure is used by [15] for the Pareto distribution with type II censoring and by [16] for obtaining the 
relative efficiency of the information in type I censored sample at time T from the exponential distribution. 

A sufficient statistic is a statistic that in a certain sense captures all the information about the parameter of interest 
in the sample data, and any conditional information beside the value of the sufficient statistics does not contain any more 
information. 

The aim of this paper is to investigate sufficient statistics of the exponential scale parameter A of the Exp (A) 
distribution with probability distribution and reliability functions given respectively by 

f(t) = \exp(-j) ,t>0,A>0 (3) 

R(t) = exp(-0 ,t > 0,A > 0 (4) 

As a measure of the information in the data obtained using different censoring schemes . The exponential 
distribution is considered because it plays an important basically role in survival and reliability analysis. Its widely used in 
the development theory of life testing because of its mathematical tractability as it belongs to both exponential and weibull 
family distributions. More details are found in [17]-[19]. 

Consider a complete random sample t i( i = 1,2, ... , n from the Exp(A), then the likelihood function of X is given 

by 

£ com a)=^ n exp(-%% (5) 

Applying the factorization theorem, S(com) = £" =1 t t is a sufficient statistics of A 
We define the amount of information given the complete sample as 

E(S(com)) = E(Zf=i t t ) = nX (6) 

For other censored life testing experimental data with sufficient statistic S(cen), the relative loss of information 
due to censoring is defined as 

And, the relative efficiency of the censoring schemes as 

efHcen) = = (8) 

Consequently, 

R(cen) = 1 - eff(cen) (9) 

The relative loss of information and efficiency are obtained for type I censored data in section 2 and for type I 
censored data in section 3, and for progressively type I and progressively censored data in the sections 4 and 5 respectively. 
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In section 6 numerical results about the theoretical findings are illustrated. Conclusion of the overall paper is involved in 
section 7. 

2- Type 2 Censored Data 

Suppose that n units are placed on a life testing experiment and the experiment is terminated when observing the 
r-th failure, where r < n is a pre fixed known constant. Hence, the only observable failure times data are t(l) <,t(2), ... < 
t(r).and the remaining (n — r)failure times are censored. Such type of data is referred to as type II censored data. Assume 
that the life times of the n units are identically distributed follow the Exponential (X) distribution, then the likelihood 
function of X given the ordered failure times t(l) <, t(2), ... < t(r) is 

LW = -g-exp (- glmpWr) ) (10) 

By the factorization theorem, sufficient statistics for X based on the type II censored data is given by 

S(typeIIceri) = £i t(i) + (n - r)t(r) (1 1) 

The amount of information utilized from the type II censored data E(S(typeIIcen)) can be found using the 
following theorem. 

Theorem: (Peter.2002.P129):If the lifetime variables t(l),t(2), ...,t(r) are identically distributed by Exp(A), then 

. 2( S£ tq) + frt- r) too ) 

Y = & has x(2r) distribution. 

This implies E(S(typeIIceri)) = rX. Therefore, the relative loss of information due to type II censored data is 

R(typellcen) = " = the proportion of censored data (12) 
and the efficiency of type II censored data is 

eff(typellcen) = - = the proportion of the observable data (13) 
Since, fisher information in the type II censored data is given by 
/(A) = -E 

( dA ( 2 ( } ) - Ji" ' this implies. the relative loss of fisher information due to type II censored data is 

given by: 

? - ? = [J ir = R (typ eIIcen ) (i4) 

3- Type I Censored Data 

If the life testing experiment considered in section (2) is terminated at a pre-assigned time T > 0 . Then we have type 
I censoring, with the lifetimes data: t(l) <, t(2), ... < t(r) < T, the likelihood function of X is given by 
^^^(- g^) (15) 
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Hence, sufficient statistics of X is 

S(typelceri) = Y r 1 t(i) + (n-r)T (16) 

Since : E(Xi t(t) + (n — r)7) > E(Xi t(t) + (n — r)t(r)) = rA ,clearly based on the sufficient statistics with same 
proportion of censored data , the amount of information using type I censored data is greater than the amount of 
information using type II censored data. Since 

/; (.S ( type/aen) ) = EC^~* tCO + (« — r)1~) = 
1 

This implies, as compared to type II censored data with Y failures, the a mount of information : 

E(S(typeIcen)) = rA+ 0(A) Where 0(A) = E ((T - t(r))(n - r)) 

For type I censored data r is a random variable distributed as bin (n, 1 — exp jJJ 

Hence, £(r) = n( 1 — exp ~J) , setting f = [F(r)] implies that an approximate value of £(t(r)) can be 
obtained using the theory of order statistic as: 

— Y f - (17) 

This implies E(S(typeI cerif) can be approximated by: 
E(S(typeIceri)) = f(X + (T - - — ^ — -V -, V^r)) 

v v ,p ^ v v A(r-l)!(n-r)! ^J" 1 (n+7-r+l) 2 " 

Hence ,the relative loss of information due to type I censored data is given by 

Ritypelcen) = C"*-*(*ftp™0); (18) 
And the efficiency of type I censoring is given by 

effitypelcen) = £(5(ty ^ fcen)) (19) 

TLA 

4-PROGRESSIVELY TYPE I CENSORED DATA 

The progressively type I censored data are obtained from the life test as follows :Assume that k > 2 inspection 
times 0 < t-l < t 2 < ••• < T k < oo are predetermined and n units are put on the life test for failure at time O.At the 
inspection timer^c^ failure units in the interval (0, t-J are recorded, and R ± of the n — cL\ surviving units are randomly 
removed. Continuing the test, at time t ± ,d 2 failure units in the interval (t 1 , t 2 ] are recorded,and R 2 of the n — d ± — d 2 — 
/?! are selected and randomly removed.. finally ,at the censoring time T k ,d k failure units in the interval (T k _ 1( T fe ] are 
recorded and all the surviving units R k — n — Y.j =1 dj — Rj are removed and the test is stopped. 

The likelihood function of X based on the progressively type I censored data 
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(d 1( d 2 , ... . , d k , R t , R 2 , ... , fifc.i) is given by 

L(A) = C njLi(l - exp (- <2=p=I))<r exp (- ^-f^ ) (20) 
Hence, sufficient statistics of A is 
S(PtypeIcen) = ZjUVtd, + 

To find E(S(PtypeIceri)) ,we have to find (d } ),E(R } ) J = 1, 2, ... , k 

Setting: q» y (A) =the conditional probability of failure in the interval (t 1 ,t 2 ] given of survival at the time and 
assuming Xj — t ; _! = S then 



l-exp(*£),j = l 
1-exp (-3,7 = 2 fe! 



qjW = P(t ; _! < T < Tj \T > t ; _!) = 
This implies: 

d t is distributed as bln(n, q t (X)) 

(dj\dj_ lt ... , d±, Ri-i, ...^^distributed as bin(n — T.[Zi^i ~ £{=i^i> QjW) 

If Rj are proportion of the remaining surviving units, i.e.Rj — (n — £/=i d ; - — 2y=i Rj)Pj J — 1,2, ... , k — 1 and 
p fc = l.By induction: 

£"(£*!> = n(l - exp = np^exp (-y) 

E(d 2 ) = n ( 1 - e*p e*p (^) (1 - Pl ) 

E(Rj) = p j ln-Y j E(d t ) - £ E(i? f ) J J = 2,3 k 



i = l i=l 



£-(d ; ) = q f (X) [n - ^ E(d £ ) - ^ E(J? £ ) ] J = 3, 4 

i = l i=l 



Substituting for f(d y -), E(R^),j — 1, 2, ... , k in (21), the amount of information in progressively type I censored 
data is given by 

E(S(PtypeIcen)) = £, k =1 t^ECd,) + Zj^i ^E(Rj) (21) 
Hence, the relative loss of information due to progressively type I censored data is 
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R(Ptypelcen) = n ^ E(S(P ^ pelcen)) ) (22) 

And, the relative efficiency of progressively type I censored data is 
eff (Ptypelcen) = E(S(P ^ elcen)) (23) 

5- PROGRESSIVELY TYPE II CENSORING 

Consider an experiment in which n units are placed on a life testing experiment. At the time of the first failure, 
Rl units are randomly removed from the remaining n — 1 surviving units. Similarly, at the time of the second failure, R2 
units from the remaining n — 2 — Rl units are randomly removed. The test continues until the m — th failure at which 
time, all the remaining Rm — n — m — Rl — R2 — — Rm — 1 units are removed and the test stopped. Hence, ,the 
likelihood function of A based on the progressive type II censored data(t(l) < t(2), ... < t(rri), R t , R 2 , ...,.R m ) can be 
simplified as 

L(A)=^exp(-^^) (24) 
Where the constant C is independent of X, by the factorization theorem, sufficient statistics is given by 
S(Ptypellcen) = £™ i(l + Ri)t(i) (25) 

Hence, either R it i = 1, 2, ...,m are predetermined constants, or assumed to be binomially removals with 
probability P i ,0<P i <l, i.e R t distributed as bin(n — m — TijZiRj.Pd to find E(S(PtypeIIcen)), assume 
t(i), Ri, i — 1,2, ... , m are independent, from the theory of order statistics 

= a-i^n-o: ^=»- lj(n + y - f+1) " 2 (26) 

Since ,E(Ri) = (n — m — Ej=i Rj)Pi, this implies, the amount of information in the progressively type II 
censored data: 

E(S(PtypeIIcen)) = 

m / i-1 \ i-1 

i = l \ j=l / j=0 

Therefore, the relative loss of information due to progressively censored data is given by 

ni/ / i-1 \ \ i-1 

R(Ptypell) = 1 - (n - 1)! g |l + \n - m - g RiJ P; j - _ - - - _ . - g -l*(n + j - i + l)" 2 (27) 
and. the efficiency of the progressively censored data is 

eff(Ptypellcen) = (n - 1)!^1 + U - m - ^ Rj P s j - _ - ^ _ , - ^ -l»(n + j - i + l)" 2 
We notice that, if P t — 0, Vi = 1,2, ... ,m — land P m — l,then we have exactly type II censoring. 



(28) 
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6-NUMERICAL RESULTS 

To illustrate theoretical finding in this paper, random samples of size n = 20 from the Exp(X) distribution are generated 
with X — 3, 5, 7. Relative loss of information and efficiency of type I and type II censored data are presented and compared 
with the relative efficiency based on Awad Sup-Entropy given by 

A cen (M = (1 " exp (- 3 - 1 exp (- ~) (29) 
in Table 1 . 



Table 1: Relative Loss of Information and Efficiency of Type I and Type II Censored Data 



X 


noitroporP 
gniorsnec fo 


e 


h 


< 




3 


0.10 


0.90 


0.0910 


0.9090 


0.8366 




0.30 


0.70 


0.2914 


0.7086 


0.5769 




0.50 


0.50 


0.4922 


0.5078 


0.5063 














5 


010 


0.90 


0.0906 


0.9094 


0.8527 




0.30 


0.70 


0.2907 


0.7093 


0.6190 




0.50 


0.50 


0.4913 


0.5087 


0.5580 














7 


0.10 


0.90 


0.0903 


0.9097 


0.8605 




0.30 


0.70 


0.2902 


0.7098 


0.6372 




0.50 


0.50 


0.4909 


0.5091 


0.5767 



It is clearly appears that: I 

(1) As proportions of censored data increases, the relative loss of information increases and the efficiency 
decreases using both types of censoring. 

(2) For type I censored data with fixed proportion of censoring, the relative efficiencies increases and the relative 
loss of information decreases as the values of the scale parameter increases, while type II censored data do not affected by 
the parameter values . 

(3) Based on the sufficient statistics, the relative efficiencies of type I censored are greater than their 
corresponding values based on Awad sup entropy. 

(4) Based on the sufficient statistics, the relative efficiencies of type I censored are greater than their 
corresponding values using type II censored data. 

To measure the relative loss of information and efficiency of progressively type I censored data .equidistance 
inspection intervals are considered with number of inspections = 4, 5, 6 , and fixed probability of removals P = 
0. 05, 0. 10 as presented in Table 2. It is clearly that: 
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Table 2: Relative Loss of Information and Efficiency of Progressively Type I Censored Data 



1 
A 


Probability of 


Number of 
Inspections(k) 


c 


e 
6 


3 


0.05 


4 


0.3021 


0.6976 






5 


0.2987 


0.7013 






6 


0.2965 


0.7035 




0.10 


4 


0.3201 


0.6799 






5 


0.3145 


0.6855 






6 


0.3008 


0.6992 


5 


0.05 


4 


0.2996 


0.7004 






5 


0.2934 


0.7066 






6 


0.2871 


0.7129 




0.10 


4 


0.3166 


0.6834 






5 


0.3067 


0.6933 






6 


0.2984 


0.7016 


7 


0.05 


4 


0.2878 


0.7122 






5 


0.2806 


0.7294 






6 


0.2765 


0.7235 




0.10 


4 


0.3001 


0.6999 






5 


0.2963 


0.7037 






6 


0.2892 


0.7208 



(1) The relative loss of information decreases and the relative efficiencies increases as the number of inspections increases. 

(2) As the proportions of removals increasing from 0. 05 to 0. 10, the relative losses of information increases and the 

relative efficiencies decreases. 

(3) For fixed probability of removals and fixed number of inspections, the relative loss of information decreases as values 

of the scale parameter increases . 

To measure the relative loss of information and efficiency of progressively type II censored data .two, three and 
four stages are considered. For the two stages, removals are assumed at the failure times t(10), t(18) with probability of 
removals P 1Q = 0. 10, P 18 = 1 in the first case and P 10 = 0. 10,P 18 = 1 in the second case. For the three stages 
,removals are assumed at the failure times t(7), t(10), t(18) with probability of removals P 7 = 0. 05,P 10 = 0. 05,P 18 = 
1 in the first case and P 7 = 10,P 10 = 0. 10,P 18 = 1 in the second case. For the four stages ,removals are assumed at the 
failure times t(5), t(8), t(15), t(18) with probability of removals: P 5 = 0. 02 , P 8 = 0. 03 , P 15 =0.05, P 18 = 1 in the 
first case and P5 — 0.05,P 8 — 0.05,P^5 — 0. 10,P^ 8 — 1 in the second case. Otherwise probabilities of removals 

assumed to be identically O.as it appears in Table 3. It is clearly that 
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Table 3: Relative Loss of Information and Efficiency of Progressively Type II Censored Data 



Stages 


case 


i 




Two 


first 


0.1214 


0.8786 




second 


0.1537 


0.8463 


Three 


first 


0.1937 


0.8073 




second 


0.2185 


0.7815 


Four 


first 


0.2285 


0.7715 




second 


0.2352 


0.7648 



(l)while type I censored gave relatively better efficiencies than type II censored data progressively type II censored data 
have more relative efficiencies than progressively type I censored data with same proportion of removals. One explanation 
for this, is that in case of progressively type I, there exists extra loss of information about the exact failure times. 

(2) As the probabilities of binomial removals increases, the relative loss of information increases and the relative 
efficiencies decreases. 

(3) With the same of binomial removals, the relative losses of information increases and the relative efficiencies 
decreases as the numbers of stages of censoring increases and hence, the optimum censoring is the one stage ordinary type 
II censoring. 

7-CONCLUSIONS 

Based on sufficient statistics, the main factors that affects the relative losses of information and efficiencies of the 
censored data are, the proportion of censoring, the indexed parameter values and the length of the life testing experiments. 
For type II and progressively type II censoring, the amount of information is basically related with the proportion of 
censoring and probabilities of removal units at different stages of the life testing. While type I censoring provide more 
information than the ordinary type II. Progressively type II censoring manifests more relative efficiencies than 
progressively type I censoring. This is due to the specific loss of information associated with progressively type I in the 
exact failure times. 

The overall theoretical and numerical found out results in this paper, ensures that sufficient statistics could be a 
probable measure for the amount of information in censored data. The developed methods can be generalized to other life 
time models using mathematical transformations and modern software tools Future researches may highlights planning 
life testing experiments based on sufficient statistics with other censoring mechanisms. 
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